Picture for Zhengzhong Tu

Zhengzhong Tu

Ben

Modular Safety Guardrails Are Necessary for Foundation-Model-Enabled Robots in the Real World

Add code
Feb 03, 2026
Viaarxiv icon

FASA: Frequency-aware Sparse Attention

Add code
Feb 03, 2026
Viaarxiv icon

Position: Human-Centric AI Requires a Minimum Viable Level of Human Understanding

Add code
Jan 31, 2026
Viaarxiv icon

BibAgent: An Agentic Framework for Traceable Miscitation Detection in Scientific Literature

Add code
Jan 12, 2026
Viaarxiv icon

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Add code
Jan 04, 2026
Viaarxiv icon

FlowSteer: Conditioning Flow Field for Consistent Image Restoration

Add code
Dec 09, 2025
Viaarxiv icon

3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation

Add code
Nov 11, 2025
Viaarxiv icon

Background Fades, Foreground Leads: Curriculum-Guided Background Pruning for Efficient Foreground-Centric Collaborative Perception

Add code
Oct 22, 2025
Viaarxiv icon

SuperGen: An Efficient Ultra-high-resolution Video Generation System with Sketching and Tiling

Add code
Aug 25, 2025
Viaarxiv icon

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Add code
Jul 16, 2025
Figure 1 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 2 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 3 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Figure 4 for MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding
Viaarxiv icon